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One of the most common causes of functional frailty is major depressive 
disorder (MDD). MDD is a chronic condition that requires long-term 
therapy and professional assistance. Additionally, MDD effective treatment 
requires early detection. Unfortunately, it has intricated clinical 
characteristics that make early diagnosis and treatment difficult for 
clinicians. Furthermore, there are currently no clinically effective diagnostic 
biomarkers that can confirm an MDD diagnosis. However, 
electroencephalogram (EEG) data from the brain have recently been used to 
make a quantitative diagnosis of MDD. In addition, As being among the 
most cutting-edge artificial intelligence (AI) technologies, deep learning 
(DL) has exhibited superior performance in a wide range of real-world 
applications, from computer vision to healthcare. However, an additional 
challenge could be the extraction of information from the ECG raw data. 
This paper presents a method for converting EEG data to power spectral 


density (PSD) images, and then they were classified as healthy or MDD 
using a deep neural network for feature extraction and a machine learning 
(ML) classifier. When employing the proposed approach, the images formed 
from the PSD show a considerably improved performance in classification 
results. 
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1. INTRODUCTION 

Depression and other mental illnesses are significant public health concerns because they contribute 
significantly to the global disease burden and considerably impact people's social and economic [1]. 
According to the World Health Organization (WHO), mental diseases are expected to be the leading cause of 
disability worldwide by 2020. The WHO implies that around 264 million individuals of all ages are believed 
to have some sort of mental illness, indicating that up to 27% of the general population will experience 
mental health disorders at some point in their lives. Depression symptoms include despair, loss of interest and 
focus, physical malfunction, insomnia, guilt, difficulty making decisions, and suicidal thoughts [2], [3]. 
Nevertheless, the precondition for treatment is the requirement for an accurate diagnosis [3]. 

Mentally problems are typically diagnosed through an individual's self-report of responses to 
specific questions helps to identify specific types of emotion or social interaction, in contrast to other 
physical illnesses diagnosed through laboratory results and assessments [4]. Unfortunately, half of the 
depressed patients are either ignorant of their condition or misdiagnosed (WHO 2017). Therefore, 
electroencephalogram (EEG) is extensively used in medical settings. Due to its high temporal resolution, 
economical, intrusive method, and ease of setup [5]-[9], EEG is a valuable technique for diagnosing mental 
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illnesses such as depression [10]-[13]. The EEG system collects data from the many locational areas of the 
brain. However, the raw EEG signal is complex due to its temporal and spatial fluctuation [14]. Therefore, in 
major depressive disorder (MDD) classification, specific subject knowledge is necessary for traditional hand- 
crafted feature extraction. Traditional data preprocessing, on the other hand, expenses quadratically increase 
as the number of features increases [15]. Therefore, mature extraction research has concentrated on the time 
[16]-[21], frequency [22]-[24], or the EEG signal's time-frequency components [25], [26], with little attention 
paid to the spatial dimension. 

As more data on an individual's mental health status becomes available, AI and machine learning 
techniques have been used to increase our understanding of mental health issues and aid mental health 
clinicians in making more informed therapy decisions [27]-[29]. Deep learning (DL) is a technique for 
automatically extracting meaningful information or patterns from data through multilayered neural networks 
[28]. Many notable advancements in DL approaches include Autonomous vehicles, natural language 
processing, face detection, synthesis of text-to-speech, transcription of handwriting, medical analysis, 
personal digital assistants, and recommender systems [30]. Convolutional neural networks (CNNs) have 
proven superior in various applications. Object recognition, medical diagnosis and prognosis, classification 
algorithms, and clustering are just a few examples [30], [31]. Furthermore, recent research has shown the 
importance and value of deep learning architectures in neurology, mainly when using neuroimaging 
data [32]. Numerous studies in the recent literature in brain-computer interfaces (BCI) have established the 
reliability, durability, and trustworthiness of the EEG-based technique in combination with DL. Particularly 
useful in the classification of motor imagery tasks [33]-[36], the prediction and detection of epileptic 
seizures, the prediction and detection of driver fatigue [37], the classification of emotion and affective states 
[21], the detection of sleep stages [38]. 

According to recent research, the CNN approach is particularly effective in tasks like feature 
extraction and classification. However, using CNN for feature extraction and traditional classifiers (such as 
support vector machine (SVM), k-nearest neighbor (KNN)) for classification would be more effective for a 
small dataset. The scope of this study is to extract Power spectral density (PSD) from a multichannel EEG 
signal and convert it into images to characterize mental states accurately. Therefore, to classify EEG data, a 
data mining methodology was built. First, the features are extracted into a new space using a CNN. After 
that, based on the features gathered, a prediction model was created to identify patterns in the data. The main 
contributions of this work are as shown in: 

- Propose an end-to-end model to diagnose depression using row ECG data after using PSD images 
without any complex converting process costing time and computational power. 

- The proposed method was tested using EEG data in three cases, eyes open, eyes closed, and task given. 

- A comparison contacted with similar methods on the same data proved that the proposed method 
reached the highest value in terms of accuracy, sensitivity, f-1 score, and AUC. 

The article's content is divided into the following sections: section 2 demonstrates the related work, 
section 3 discusses the data collection, representation, and methods presented. Sections 4, 5 summarizes the 
results of this study and compares them to those of other recent studies. Finally, section 6 concludes the paper 
and makes recommendations for future research. 


2. RELATED WORKS 

Historically, to diagnose depression, a variety of EEG-based ML techniques were used. For 
example, [39] analyzed the 5-minute EEG signals of 30 patients using relative wavelet energy, entropy, and 
an artificial neural network (ANN) [40]. Studied the frontal brain waves of 12 depressed individuals and 12 
healthy subjects using Higuchi and Katz Fractal measures. In addition, Ahmadlou et al. [41] conducted 
experiments on 22 MDD patients and developed a novel nonlinear approach: spatial-temporal analysis of 
EEG signals' relative convergence [42]. Used a variety of nonlinear approaches such as detrended fluctuation 
analysis and greatest Lyapunov to determine the degree of signal complication. 

Additionally, researchers used correlation dimension and Welch's power spectral density to assess 
power in specific frequency bands [43]. Extracted relevant characteristics from 15 healthy and 15 depressive 
individuals combining haar wavelet decomposition with various entropies [3]. Diagnosed depression in 37 
university students using an event-related potential approach. Two alternative feature selection procedures 
have been used to discover the more effective characteristics for classifying the two groups: greedy stepwise 
and genetic search [44]. Evaluated characteristics using linear parameters from EEG signals, band power, and 
alpha interhemispheric imbalance to determine accuracy [45]. Classified 34 participants into depressive and 
healthy groups using the spectral asymmetry index and detrended fluctuation analysis approaches. 

Numerous studies on brain signal categorization have used various techniques to convert EEG 
signals to visual representations. One study, for instance, converted EEG time series to 2D images using the 
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short-term Fourier transformation approach then classified EEG motor imagery signals using a combination 
of 1D CNNs and stacking autoencoders [46]. This technique outperformed the winning algorithm by 9%. In 
addition, [47] developed a new format for EEG signals that maintain the pattern of EEG data over a wide 
range of space, time, and frequency. The spectral power within three principal frequency bands was 
determined for each point by projecting three-dimensional electrode sites onto a two-dimensional surface 
using azimuthal equidistant projection. This two-dimensional surface was then utilized to create 
topographical maps, which were combined to create three-channel views. Next, the samples were identified 
using these three-channel images fed into a deep convolutional recurrent neural network. Mohammadi et al. 
[48] the EEG data of 53 MDD and 43 healthy subjects were classified using a data mining algorithm. The 
features were first mapped into a new feature space using linear discriminant analysis, and then the most 
significant characteristics were determined using a genetic algorithm. Then, using a decision tree classifier, a 
prediction model was constructed to identify rules and patterns in the data based on the decreased and 
mapped data. The model achieved an identification rate of 80% using this strategy, which significantly 
decreased the overall number of features needed. 

Li et al. [49], EEG data from 37 participants were obtained using a 128 channel HydroCel Geodesic 
Sensor Net in an experiment. Support vector machine (SVM), Bayesian Network, random forest (RF), 
logistic regression (LR), and k-nearest neighbor (KNN) techniques were used to differentiate between MDD 
patients and healthy controls. In addition, a feature selection approach, to achieve the best results, a stepwise 
greedy algorithm for the beta frequency band relying on correlation feature selection and a KNN were 
applied. The accuracy was 92% and 98%, respectively, while the area under the curve (AUC) was 0.957 and 
0.997. However, most investigations focused on features engineering and classification improvement with 
limited outcomes, as indicated by the previous research, and finally, precisely diagnosing this mental disease 
remains a complex undertaking. 


3. MATERIALS AND PROPOSED METHOD 
3.1. Dataset 

The dataset utilized in this study was supplied by [44] and is freely available online 
(www.figshare.com). Furthermore, the ethics committee of Hospital University Sains Malaysia approved the 
methodology. 34 MDD ranging from (27-53) years (average=40) and 30 healthy volunteers ranging from 
(22-53) years (average=38) were recorded. According to the diagnostic and statistical manual IV, the 
depressed group met the criteria for participation in the study (DSM-IV) [50]. Figure 1 shows (a) EEG 
samples for MDD and (b) healthy individuals. 
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Figure 1. EEG samples from the dataset (a) MDD and (b) healthy 


3.2. Acquiring and preparing data 

The EEG data were acquired (with the eyes closed, eyes opened, and task given) for 5 minutes using 
a 19-channel EEG cap placed on the head following the international standard electrode position scheme of 
10-20. The sampling rate of the EEG was set to 256 Hz. Figure 1 illustrates a sample of EEG data from 
healthy and depressed participants. 
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3.3. Power spectral density (PSD) 

Since the amount of EEG data is constrained, a parametric method is the only way to approximate 
the actual spectrum. Therefore, The AR Burg approach is used to calculate the PSD. We employ a sliding 
Hamming window with 256 data points (250 milliseconds) and 128 data points overlap (125 milliseconds) to 
improve spectrum estimation performance. The spectrum estimation process is divided into two phases. First, 
with a specified data sequence, estimate the model-based technique's parameters x(n) O0<n<N-1. Second, 
using such estimates, calculate the PSD. 

Essentially, the AR technique works by stimulating the input data x(n) as the outcome of a discrete 
causal filter with white noise for the input, as defined in (1): 


x(n) = — Dp ak) + x(n—k) + w(n), a) 


where a(k) denotes the AR coefficient, n is white noise with a variance a”, and p denotes the AR model's 
order. The recursive Burg approach is used to estimate AR coefficients in this study, which reduces forward 
and backward error rates [51]. PSD estimate is obtained from the Burg method's calculation of AR 
parameters as defined in (2): 


êp 
[1+ Xp; Ge 27k]? 


Pf) = (2) 


where ê, denotes the sum of the least-squares errors. The Akaike information criterion is used to calculate 
the model order p of the AR technique. 
The model order is set at p=10 throughout this study. After calculating PSD for raw ECG data, 


images of these plots were used as input of the CNN model for feature extraction. Figure 2 shows (a) PSD 
plots for each healthy and (b) PSD plots for MDD. 
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Figure 2. PSD plot samples (a) healthy and (b) MDD 
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3.4. Feature extraction 

The MobilenetV2 network is one of the most widely utilized and lightweight CNNs [51]. It is 
developed for images and may be used for both classification and feature extraction. It employs an inverted 
residual and a linear bottleneck. Figure 3 illustrates how the blocks are stacked together. It starts by using 
1x1 point convolution to extend input channels. The input feature is then extracted using deep convolution, 
and the convolution aggregator is used to aggregate the output while maintaining the network size low 
linearly. Finally, it replaces the ReLU6 with a linear function to adjust the output channel size to the input. 


Conv 1x1 
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Figure 3. MobilNetV2 model 


The reverse block is used by the MobilenetV2 model to obtain additional functionality by 
combining features over layers. Furthermore, the input channels and filters are separated into separate 
channels using depth convolution, using a 1x1 kernel to combine the outputs. Due to its low number of 
parameters compared to other regularly used CNN models (as shown in Table 1), MobileNetV2 is targeted 
for usage with mobile or low-cost devices. 


Table 1. Number of trainable parameters for some well-known models 


Model No. trainable parameters 

Xception 22,910,480 

VGG16 138,357,544 

ResNet50 25,636,712 
InceptionV3 23,851,784 
InceptionResNetV2 55,873,736 

MobileNet 4,253,864 

MobileNetV2 3,538,984 


The MobileNetV2 is a reliable network with a quick response time. This work used the MobileNetV2 pre- 
trained model (ImageNet-based) and generated 250,880 features from each image. 


3.5. Classification 

The suggested model is presented to classify EEG data for MDD and healthy samples accurately. 
The features extracted by MobileNetV2 are classified using well-known classifiers (RF, SVM, and LR). 
Figure 4 shows a schematic representation of the proposed method. 
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Figure 4. The proposed method structure 
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The proposed method has been applied and evaluated using the following explained performance 
metrics to show the superiority of our method. The working steps are explained: 
Step 1: read the EEG file from the dataset for eyes opened EO, eyes closed EC, and given task case. 
Step 2: use PSD for each EEG sample and make a plot image. 
Step 3: feature extraction using MobileNetV2. 
Step 4: split data obtained from step 3 into train and test (54 samples for train and 10 for test equally 
distributed between MDD and healthy). 
Step 5: use proposed classifiers. 
Step 6: evaluate the results using performance matrices. 
All processing, feature engineering, and data classification steps were implemented in Keras library and 
Python 3.8 on a computer with Intel(R) Core (TM) i7-10750H CPU @ 2.60GHz with 2.59 GHz 32.0 
Gigabyte of RAM, and a display adapter from NVIDIA GeForce GTX 1650 Ti. 


3.6. Performance matrices 

The model's performance on the testing set was examined using five performance parameters in this 
study: accuracy rate, sensitivity, Fl-Score, precision, and area under the curve (AUC). AUC is the likelihood 
of the model correctly categorizing a random positive image sample over a random negative image sample. It 
is determined using different thresholds to calculate the true positive rate (TPR) and false positive rate (FPR). 
As shown in (3)-(6) indicate the first four performance matrices: 


TP +TN 


Accuracy = ee RET 5 
“ge . TP 

Sensitivity = rea r 

Precision = 2b P 
TP + FP 

Fi = Score = 7 y Eredsion X Sensitivity) 5 


(Precision+Sensitivity) ” 


where TP (true positive) denotes the number of MDD individuals successfully classified, TN the number of 
healthy people (true negative) individuals accurately classified, FN represents the number of MDD patients 
who were incorrectly classed as healthy FN (false negative), and FP (false positive) is the number of healthy 
people who have been misclassified with MDD. 


4. RESULTS 

The test data accuracy, sensitivity, precision, fl-score, and AUC for features extracted obtained 
from MobileNetV2 model by each classifier are given in Table 2. The evaluations were calculated using EEG 
taken with eyes opened, eyes closed, and while task given for the volunteers. The test data contains five 
MDD and five healthy samples, which the model did not have been trained on to prove the generalization 
ability of the method. 


Table 2. Performance matrices for each classifier in different cases 


Classifier Accuracy Sensitivity Precision _Fl-score AUC 
RF while eyes opened 0.7000 0.7000 0.7083 0.6970 0.70 
RF while eyes closed 1.0000 1.0000 1.0000 .0000 1.00 
RF while task given 0.9000 0.9000 0.9167 0.8990 0.90 
SVM while eyes opened 0.8000 0.8000 0.8000 0.8000 0.80 
SVM while eyes closed 1.0000 1.0000 1.0000 .0000 1.00 
SVM while task given 0.8000 0.8000 0.8571 0.7917 0.80 
KNN while eyes opened 0.9000 0.9000 0.9167 0.8990 0.90 
KNN while eyes closed 1.0000 1.0000 1.0000 .0000 1.00 
KNN while task given 0.7000 0.7000 0.7083 0.6970 0.70 
LR while eyes opened 0.8000 0.8000 0.8000 0.8000 0.80 
LR while eyes closed 1.0000 1.0000 1.0000 .0000 1.00 
LR while task given 0.9000 0.9000 0.9167 0.8990 0.90 
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Figure 5 shows the confusion matrix for each model with each case using test data. Figure 5(a) 
random forest, Figure 5(b) SVM, Figure 5(c) KNN, Figure 5(d) LR it is noticeable that the best results were 
obtained for EEG data taken while eyes closed. However, the results were identical for all classifiers, too, 
with 100% for each performance matrices. Moreover, the results of EEG data while a task is given are better 
than the results while eyes opened in general. 
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Figure 5. Confusion matrix for each classifier in different cases, (a) random forest, (b) SVM, (c) KNN, 
and (d) LR 
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5. DISCUSSION 

Depression screening is critical for early diagnosis and treatment. However, artificial intelligence 
techniques can circumvent this barrier and enable widespread use without highly skilled professionals. Our 
method successfully employed deep learning and machine learning classifiers to detect MDD patients and 
healthy controls. When applied to 19 channels of EEG signals with closed eyes, the MobilNetV2 architecture 
with a classifier achieves a precision of 100%. One of this work's primary innovations is its use of PSD plot 
images obtained from EEG data and classification is made after feature extraction, and this method is tested 
using data obtained while eyes closed, eyes opened, and given a task to the participants. There is no need for 
complex pre-processing steps for feature extraction in this method, significantly reducing the running time. 
However, the classifier cannot analyse the PSD plot images. While using light-weighted MobileNetV2 for 
feature extraction enabled the classifiers to make perfect results. 

In Table 3, the findings of this study are compared to those of recently published comparative 
research that used EEG from the same and a different dataset. As can be shown, the accuracy obtained in this 
study is higher than that obtained in previous studies using classic machine learning approaches for linear and 
nonlinear feature extraction, demonstrating the suggested method's superiority. Furthermore, these results are 
more accurate than those obtained using alternative DL techniques on EEG time series data. As a result, as 
shown in Table 3, this work obtained the highest results in the automated detection of depressive and healthy 
people to date. This study's primary limitation is the size of the dataset used to train the classifier. However, 
by conducting features extraction using the pre-trained MobilNetV2 network, we overcame this limitation. 


Table 3. Results comparison with similar works 


Study Method Accuracy Sensitivity 
Acharya et al. [51] SVM 0.98 0.97 
Mumtaz et al. [44] Naive Bayes, LR, SVM 0.98 0.96 
Acharya et al. [52] 1Dimentional -CNN 0.95 0.95 
Ay et al. [53] CNN- long short-term memory (LSTM) 0.99 0.98 
Mumtaz and Qayyum [54] CNN-LSTM, 1!Dimentional-CNN 0.98 0.98 
Suggested research CNN- RF, SVM, LR, and KNN 1.00 1.00 


6. CONCLUSION 

This study conducted a detailed investigation using a CNN with a pre-trained model (MobilNetV2) 
and many well-known machine learning algorithms for feature extraction (RF, SVM, LR, and KNN). As a 
result, the highest accuracy of 100% in classifying MDD and normal was reached for eyes closed individuals. 
Relying upon the outcomes, the newly released deep learning model can effectively assess brain EEG data 
and give the best findings compared to all current studies. As a result, suggested technologies can assist 
health care providers in identifying MDD patients for early identification and prevention. In addition, this 
method can be adapted for future work to classify other EEG-based diagnoses of various neurodegenerative 
diseases, such as schizophrenia, Parkinson's disease, and Alzheimer's disease. 
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